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Two channels are considered; a discrete channel which can transmit se- 
quences of binary digits, and a continuous channel which can transmit band 
limited signals. The performance of a large number of simple signalling 
alphabets is computed and it is concluded that one cannot signal at rates 
near the channel capacity without using very complicated alphabets. 

INTRODUCTION 

C. E. Shannon's encoding theorems' associate with the channel of a 
communications system a capacity ('. These theorems show that the 
output of a message source can be encoded for transmission over the 
channel in such a way that the rate at which errors are made at the re- 
ceiving end of the system is arbitrarily small provided only that the 
message source produces information at a rate less than C bits per second. 
C is the largest rate with this property. 

Although these theorems cover a wide class of channels there are two 
channels which can serve as models for most of the channels one meets 
in practice. These are: 

1 . The binary channel 

This channel can transmit only sequences of binary digits and 1 
(which might represent hole and no hole in a punched tape; open-line 
and closed line; pulse and no pulse; etc.) at some definite rate, say one 
digit per second. There is a probability p (because of noise, or occasional 
equipment failure) that a transmitted is received as 1 or that a trans- 
mitted 1 is received as 0. The noise is supposed to affect different digits 
independently. The cpacity of this channel is 

C = 1 + p log p + (1 - p) log (1 - p) (1) 

bits per digit. The log appearing in Equation (1) is log to the base 2; 
this convention will be used throughout the rest of this paper. 

1 C. E. Shannon, "A Mathematical Theory of Communication," Bell System 
Tech. J., 27, p. 379-423 and pp. 623-656, 1948, theorems 9, 11, and 16 in particular. 
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2. The low-pass filler 

The second channel is an ideal low-pass filter which attenuates com- 
pletely all frequencies above a cutoff frequency W cycles per second and 
which passes frequencies below W without attenuation. The channel is 
supposed capable of handling only signals with average power P or less. 
Before the signal emerges from the channel, the channel adds to it a 
noise signal with average power N. The noise is supposed to be white 
Gaussian noise limited to the frequency band | v | < W. The capacity 
of this channel is 
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bits per second. 

Shannon's theorems prove that encoding schemes exist for signalling 
at rates near C with arbitrarily small rates of errors without actually 
giving a constructive method for performing the encoding. It is of some 
interest to compare encoding systems which can easily be devised with 
these ideal systems. In Part I of this paper some schemes for signalling 
over the binary channel will be compared with ideal systems. In Part 
II the same will be done for the low-pass filter channel. 

Part I 

THE BINARY CHANNEL 

1. Error-Correcting Alphabets 

Imagine the message source to produce messages which are sequences 
of letters drawn from an alphabet containing K letters. We suppose that, 
the letters are equally likely and that the letters which the source pro- 
duces at different times are independent of one another. (If the source 
given is a finite state source which does not fit this simple description, 
it can be converted into one which approximately does by a preliminary 
encoding of the type described in Shannon's Theorem 9.) To transmit 
the message over the binary channel we construct a new alphabet of 
K letters in which the letters are different sequences of binary digits of 
some fixed length, say D digits. Then the new alphabet is used as an en- 
coding of the old one suitable for transmission over the channel. For 
example, if the source produced sequences of letters from an alphabet 
of 3 letters, a typical encoding with D = 5 might convert the message 
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into a binary sequence composed of repetitions of the three letters. 

00000 

11100 

and 00111 

If K = 2 D , the alphabet consists of all binary sequences of length D 
and hence if any of the digits of a letter is altered by noise the letter will 
be misinterpreted at the receiving end of the channel. If K is somewhat 
smaller than 2 D it is possible to choose the letters so that certain kinds 
of errors introduced by the noise do not cause a misinterpretation at 
the receiver. For example, in the three letter alphabet given above, if 
only one of the five digits is incorrect there will be just one letter (the 
correct one) which agrees with the received sequence in all but one place. 
More generally if the letters of the alphabet are selected so that each 
letter differs from every other in at least 2k + 1 out of the D places, 
then when k or fewer errors are made the correct interpretation of the 
received sequence will be the (unique) letter of the alphabet which 
differs from the received sequence in no more than k places. An alphabet 
with this property will be called a k error correcting alphabet 2 . 

Error correcting alphabets have the advantage over the random 
alphabets which Shannon used to prove his encoding theorems that they 
are uniformly reliable whereas Shannon's alphabets are reliable only in 
an average sense. That is, Shannon proved that the probability that a 
letter chosen at random shall be received incorrectly can be made ar- 
bitrarily small. However, a certain small fraction of the letters of Shan- 
non's alphabets are allowed a much higher probability of error than the 
average. This kind of alphabet would be undesirable in applications such 
as the signalling of telephone numbers; one would not want to give a 
few subscribers telephone numbers which are received incorrectly more 
often than most of the others. It is only conjectured that the rate C can 
be approached using error correcting alphabets. The alphabets which are 
to be considered here are all error correcting alphabets. 

A geometric picture of an alphabet is obtained by regarding the D 
digits of a sequence as coordinates of a point in Euclidean D dimensional 
space. The possible received sequences are represented by vertices of 
the unit cube. A k error correcting alphabet is represented by a set of 
vertices, such that each pair of vertices is separated by a distance at 

least -%/2/c + 1 

Let Ko(D, k) be the largest number of letters which a D dimensional 



1 R. W. Hamming, "Error Detecting and Error Correcting Codes," Bell System 
Tech. J., 29, pp. 147-160, 1950. 



COMPARISON OF SIGNALLING ALPHABETS 507 

k error correcting alphabet can contain. Except when k = 1, there is no 
general method for constructing an alphabet with K (D, k) letters, nor 
is K (D, k) known as a function of D and k. Crude upper and lower 
bounds for K (D, k) are given by the following theorem. 
Theorem 1. The largest number of letters K (D, k) satisfies 

< K (D, k) < " - (3) 



N(D, 2k) ~ uv ' ' ~ N(D, k) 
where 

k 

N(D,k) = T,C D , r 

r=0 

is the number of sequences of D digits which differ from a given sequence 
in 0,1, ■ • • , or k places. 

Proof 

The upper bound is due to R. W. Hamming and is proved by noting 
that for each letter S of a A: error correcting alphabet there are N(D, k) 
possible received sequences which will be interpreted as meaning S. 
Hence N(D, k) K (D, k) < 2 D , the total number of sequences. 

The lower bound is proved by a random construction method. Pick 
any sequence Si for the first letter. There remain 2 D — N(D, 2k) se- 
quences which differ from Si in 2k + 1 or more places. Pick any one of 
these £ 2 for the second letter. There remain at least 2 D — 2N(D, 2k) 
sequences which differ from both Si and So in 2k + 1 or more places. 
As the process is continued, there remain at least 2 D — rN(D, 2k) 
sequences, which differ in 2k + 1 or more places from Si , • • • , S r , 
from which S r+i is chosen. If there are no choices available after choosing 
S K , then 2 D - KN(D, 2k) < so the alphabet (Si , ■ • ■ , S K ) has at 
least as many letters as the lower bound (3). 

For all the simple cases (D and k not very large) investigated so far 
the upper bound is a better estimate of Ko(D, k) than the lower bound. 
The upper and lower bounds differ greatly, as may be seen from a quick 
inspection of Table I. For example, in the case of a ten dimensional two 
error correcting alphabet, the bounds are 2.7 and 18.3. 

2. Efficiency Graph 

The first step in constructing an efficiency graph for comparing alpha- 
bets is to decide on what constitutes reliable transmission. The criterion 
used here is that on the average no more than one letter in 10 shall be 
misinterpreted. 
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Table I 
Table of 2 d /N(D, k) 
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Missing entries are numbers between 1 and 2. 

This sort of criterion might be appropriate for a channel transmitting 
English text. For other messages it is not always appropriate. For ex- 
ample, if the messages are telephone numbers, one would naturally 
require that the probability of mistaking a telephone number be small, 
say less than 10~ . If the telephone numbers are L decimal digits long, 
and if the alphabet has K different letters in it (so that it takes about 
L log 10/log K letters to make up a telephone number) the probability 
of making a mistake in a single letter should be required to be less than 
about 

10~ 4 log K 
L log 10 

which gives alphabets with large K an advantage over alphabets with 
small K. 

Since the probability that exactly r binary digits out of D shall be 
received incorrectly is C D , r p r (1 — p) ~ r , we achieve the required re- 
liability with a D-dimcnsional fc-error correcting alphabet provided p 
satisfies 



E cwa -p) D ~ r < io" 4 . 



(4) 



The value of p which makes the inequality hold with the equals sign 
determines the noisiest channel over which the alphabet can be used 
safely. 

Let K be the number of different letters in the alphabet. Then the 



COMPARISON OF SIGNALLING ALPHABETS 



509 



rate in bits per digit at which information is being recieved is 

\ogK 



R = 



D 



(5) 



In Equation (5) we have neglected a term which takes account of the 
information lost due to channel noise. This is legitimate because all but 
10 -4 of the letters are received correctly. 

The worst tolerable probability p of (4) and the rate R of Equation 
(5) determine the noise combating ability of an alphabet. To compare 
different alphabets one may represent them as points on an efficiency 
graph of R versus p. Fig. 1 is an efficiency graph on which the values 
(p, R) for a number of simple error correcting alphabets have been 
plotted. Each point on the graph is labelled with the two numbers k, D 
in that order. The alphabets represented were not found by any systema- 
tic process and are not all proved to be best possible (i.e., to have the 
largest K) for the stated values of k and D. Fortunately, R depends on 
K only logarithmically so that it is not likely the points representing the 
best possible alphabets lie far away from the plotted points. 

The solid line represents the curve 

R = c = 1 + p log p + (1 - p) log (1 - p). 

According to Shannon's theorems, all alphabets are represented by 
points lying below this line. 

The efficiency graph only partially orders the alphabets according to 
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PROBABILITY OF ERROR IN ANY DIGIT 

Fig. 1 — Probability of error in a letter is 10~ 4 . 
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their invulnerability to noise. For example, it is clear that the alphabet 
3, 15 is better than 2, 8. However, without further information about 
the channel, such as knowledge of p, there is no reasonable way of choos- 
ing between 3, 15 and 3, 7. 

3. Large Alphabets 

We have been unable to prove that there are error correcting alphabets 
which signal at rates arbitrarily close to C while maintaining an arbi- 
trarily small probability of error for any letter. A result in this direction 
is the following theorem. 

Theorem 2. Let any positive e and 8 be given. Given a channel with p < \ 
there exists an error correcting alphabet which can signal over the channel 
at a rate exceeding Ro — e where 

R = 1 + 2p log 2p + (1 - 2p) log (1 - 2p) 

bits per digit and for which the probability of error in any letter is less 
than 8. 

Proof 

The probability of error in any letter is the sum on the left of (4). This 
is a sum of terms from a binomial distribution which, as is well known, 
tends to a Gaussian distribution with mean Dp and variance Dp(l — 
p) for large D. Hence there is a constant A (5) such that all k error cor- 
recting alphabets with sufficiently large D have a letter error proba- 
bility less than 8 provided 

k>Dp + A(8) (Dp(l - p)) m (6) 

Let k(D) be the smallest integer which satisfies (6) and consider an 
alphabet which corrects k(D) errors and contains K (D, k(D)) letters. 
By Equation (5) and the lower bound of Theorem 1, this alphabet signals 
at a rate R(D) satisfying 

1 - I log N(D, 2k(D)) < R(D). 

Since p < \, 2k(D) < D/2 for large D and hence 

N(D, 2k(D)) < (2k(D) + 1)C D ,2*( D) . 

Then an application of Stirling's approximation for factorials shows that 
as D — > co 

1 - ^\ogN(D,2k(D))^R . 
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Hence by taking D large enough one obtains an alphabet with rate ex- 
ceeding R — e and letter error probability less than 5. 

The rate R appears on the efficiency graph as a dotted line. 

It has not been shown that no error-correcting alphabet has a rate 
exceeding R . In fact, one alphabet which exceeds R in rate is easy to 
construct. If the noise probability p is greater than \, then R a = 0. The 
alphabet with just two letters 

0000 .. .0 
and 

1 1 1 1 ... 1 

will certainly transmit information at a (small) positive rate, and with 
a 10 _ probability of errors if D is large enough, as long as p < \. 

Using a more refined lower bound for K (D, k) it might be shown that 
there are error-correcting alphabets which signal with rates near C. 
If one repeats the calculation that led to R using the upper bound 
(3) (which seems to be a better estimate of the true Ko(D, /.•)) instead 
of the lower bound (3), one is led to the rate C instead of R . 

The condition (4) is more conservative than necessary. The structure 
of the alphabet may be such that a particular sequence of more than 
k errors may occur without causing any error in the final letter. This is 
illustrated by the following simple example due to Shannon : the alphabet 
with just two letters 

000000 
1110 

corrects any single error but also corrects certain more serious errors 
such as receiving 1111 for 0. An alphabet designed for 
practical use would make efficient enough use of the available sequences 
so that any sequence of much more than k errors causes an error in the 
final letter; the random alphabets constructed above probably do not. 
If this kind of error were property accounted for, the rate R could be 
improved, perhaps to C. 

4- Other Discrete Channels 

If instead of transmitting just 0's and l's the channel can carry more 
digits 

0, 1,2, ••• ,n 
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a similar theory can be worked out. The simplest kind of noise in this 
channel changes a digit into any one of the n other possible numbers with 
probability p/n. Then the capacity of the channel is 

C = log (n + 1) + v ^g V - + (1 - p) log (1 - p). 

Error-correcting alphabets for this channel can also be constructed and 
the criterion (4) for good transmission remains unchanged. The proof 
of theorem 1 can be repeated with little change using 

k 

N{D, k) = Z <? , r n 

as the number of sequences which can be reached after k or fewer errors 
[the terms 2 D in (1) and (3) are replaced by (n + 1)° ]. Once more, using 
the lower bound, one finds an expression for R which is the same as the 
one for C but with p replaced by 2p. 

Part II 

THE LOW PASS FILTER 

1. Encoding and Detection 

If f(t) is a signal emerging from a low pass filter (so that its spectrum 
is confined to the frequency band \v\ < W cycles per second) then 
/(«) has a special analytic form given by the sampling theorem 3 

fit) = E /fjL^'^-*) (7) 

Thus the signal is completely determined by the sequence of sample 
values f(m/2W). The average power of the signal f(t) is measured by 

P = lim ± I fit) dl 

r-»co ^i J— t 
which can be expressed in terms of the sample values as follows 

p =^mJtAw)- (8) 

As in Part I, consider a message source producing a sequence of letters 
from an alphabet of K equally likely letters. To transmit this informa- 
tion over the low pass filter we must encode the sequence into a function 

3 C. E. Shannon, "Communication in the Presence of Noise," Proc. I. R. E., 
37, pp. 10-21, Jan. 1949. 
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f(t) of the form (7), or in other words into a sequence of sample values 
f(m/2W). To do this, we construct a new alphabet containing K letters 
which are different sequences of real numbers of some fixed length, 
say D places. When we let the letters of the new alphabet correspond to 
letters of the old one the message is translated into a sequence of real 
numbers which we use for the sequence f(m/2W) . 
If the K letters of the sequence alphabet are 

Si', an , • • • , aiD 



Sk ' O.R1 , • • • , &KD, 

the expression (8) for the average power of the function /(/•) becomes 

P - j~ (eg + A + ■ • • + 4) (9) 

where 

D 

d] = X) f 4 • 

1-1 

If the D numbers in the sequence Si are regarded as coordinates of a 
point in Euclidean D dimensional space, d'i represents the square of the 
distance from the point representing S% to the origin. 

When /(J) is transmitted, the received signal will be f(t) + n(t) where 
n(l) is some (unknown) white Gaussian noise signal. The noise signals 
n(t) are characterized by the fact that their sample values n(m/2W) 
are independently distributed according to Gaussian laws. That is, 



^("fer)^')-^ £•"*■* 



(10) 



The variance a 2 of the distribution of noise samples is, by an application 
of (8), the power of this ensemble of noise signals. 

At the receiving end of the channel, there is a detector which observes 
each block of D sample values f(m/2 W) + n(m/2W) and tries to decide 
which one of the K letters Si , • • • , S K was sent. In terms of the geo- 
metric picture, the detector divides all of D dimensional space into K 
non-overlapping regions Ui , • • • , U K with the property that, if the D 
received sample values are represented by a point in C7, , the detector 
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decides that St was sent. By Equation (10), the probability that the de- 
tector picks the wrong letter when *S',- is sent is 



*-&&*? I Lrf ***"**'''*• 



(id 



where U% is the set of all points not in Ui and r, is the distance from 
(lli , • • * > Vd) to the point representing S% . 

For any given alphabet the best possible detector (in the sense that it 
minimizes the average probability of making an error in guesssing a 
letter) is called a maximum likelihood detector. The region Ui for a maxi- 
mum likelihood detector consists of all points (?/i , • • • , y D ) which are 
closer to the point & than to any other letter point Sj{rt < Tj for all 
j ?* i) . To prove that this choice of Ui is best possible consider any other 
detector such that C7, contains a set V of points in which r» > Vj . A 
direct calculation shows that the detector obtained by removing V from 
Ui and making V part of Uj has a smaller probability of error per letter. 
The set of points equidistant from two given points is a hyperplane. The 
region Ui of a maximum likelihood detector is a convex region bounded 
by segments of the hyperplanes 

ri = n , n = r 2 , • • ■ . 

To compare signalling alphabets under the most favorable possible 
circumstances, we always compute letter error probabilities assuming 
that the detector is a maximum likelihood detector. 

2. Computation of error 'probabilities 

Exact evaluation of the letter error probability integral (11) is im- 
possible except in a few special cases. Fortunately we are only interested 
in (11) when a is small enough in comparison to the size of Ui to make the 
integral small. Then fairly accurate approximate formulas can be de- 
rived. 

Theorem 3. Let R i3 be the distance between letter points S t and Sj . Then 

1-11(1 -Q») <Vi<HQn (12) 



where 



Qa = -7c= [ e l2 ' 2 dx. 



The proof of Theorem 3 follows from the fact that Q i} is the prob- 
ability that, when S, is transmitted, the received sequence will be closer 
to Sj than to Si . 
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In the cases to be computed Qa is a rapidly decreasing function of 
Rij and the only terms worth keeping in (12) are the ones for which 
Rn is the smallest of the numbers Ra , • • • , fl« • Moreover since 
the Qij are all small enough so that the upper and lower bounds differ 
only by a few per cent, the upper bound is a good approximation to p,- . 
Then a simple approximate formula for the average letter error prob- 
ability p = (pi + • • ■ + Pk)/K is 

P = J 7^r e~ xl,i dx (13) 

where 2r is the smallest of the K(K — l)/2 distances Ra and N is the 
average over all letters in the alphabet of the number of letter points 
which are a distance 2r away. 

3. Efficiency graph 

The efficiency graph to be described was constructed originally to 
compare alphabets for signalling telephone numbers of length equal to 
ten decimal digits. It was desired that on the average only one telephone 
number in 10 4 should be received incorrectly. As described in Part I 
section 2, if the telephone numbers are encoded into sequences of letters 
from an alphabet, of K letters, we must require that the average prob- 
ability of error in any letter be 

p = 10" 5 login K (14) 

or smaller. 

(liven an alphabet, one can compute with the help of (13) and (II) 
and a table of the error integral the largest value of the noise power a" 
which can be tolerated. The average power of the transmitted signal is 
P given by liquation (9). Hence we can compute the smallest signal to 
noise ratio 

Y = P/<x 2 (15) 

which will be satisfactory. 

A letter containing log K bits of information is transmitted during 
an interval of 1) 211" seconds. Hence the rate at which information is 
received is 

R _ guy m 

bits per second. Again Equation (16) ignores a term representing in- 
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formation lost due to channel noise which is negligible because the error 
probability is low. 

The efficiency graph, Fig. 2, is a chart on which the signal to noise 
ratio Y in db [computed from Equation (15)] is plotted against the sig- 
nalling rate per unit bandwidth R/W = (2 log K)/D for different alpha- 
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Fig. 2— Probability is 10~ 4 that an error is made in a 10 digit decimal number. 
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bets. An alphabet is considered poor if its point on the efficiency graph 
lies far above the ideal curve R/W = C/W = log (1+7). 

4. The alphabets 

The alphabets which appear on the efficiency graph are the following: 

excess three (XSS) : the ten sequences of 4 binary digits which repre- 
sent 3, 4, • • • , and 12 in binary notation; 

two out of five: the ten sequences of five binary digits which contain 
exactly two ones; 

pulse position (PP10): the ten sequences of ten binary digits which 
contain exactly one one; 

2 D binary : all of 2 D sequences of D binary digits. 

pulse amplitude (PAn) : the 2n + 1 sequences of length 1 consisting 
of —n, — n + 1, • • • , n. This alphabet gives rise to a sort of quantized 
amplitude modulation. 

pulse length (PLn) : the n + 1 sequences of n binary digits of the form 
11 • • • 10 • • • 0, i.e., a run of ones followed by a run of zeros. 

Minimizing alphabets (K, D): The above alphabets are taken from 
actual practice. They are convenient because, aside from PAn, they 
require a signal generator with only two amplitude levels. If we ignore 
ease of generating the signals as a factor, a great many geometric ar- 
rangements of points suggest themselves as possible good alphabets. 
The principle by which one arrives at good alphabets may be described 
as follows. When a D and K have been determined which give the desired 
information rate R [by Equation (16)] try to arrange the K letter points 
in D dimensional space in such a way that the distances between pairs 
of points are all greater than some fixed distance and that the average 
of the K squared distances to the origin is minimized. By Equations 
(9) and (13) it is seen that, apart from the small influence of the factor 
N, this process must minimize the signal to noise ratio Y required. 

Ordinarily it is difficult to prove that a configuration is a minimizing 
one. Even to recognize a configuration which leads to a relative minimum 
(i.e. a minimum over all nearby configurations) is not always easy. The 
eight vertices of a cube, for example, do not give a relative minimum. 
Consequently, most of the alphabets to be described are only conjectured 
to be "best possible." Each of them satisfies one necessary requirement 
of minimizing alphabets that the centroid of the point configuration 
(assuming a unit mass at each letter point) lies at the origin. That this 
condition is necessary follows from the easily derived identity 

2 2 T"»2 

r 2 = n - 7?o 
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Fig. 3 — Two dimensional alphabets. 
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Fig. 4 — Three dimensional alphabets. 
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where n is the rms distance from the origin to the points of a configura- 
tion A, R is the distance from the origin to the centroid of A, and r 2 
is the rms distance from the points of A to the centroid of A. 

In plotting points on the efficiency graph the notation K, D is used 
for the best K-letter Z)-dimensional alphabet which has been found. 
The arrangement of points for various K, 2 and K, 3 alphabets is given in 
Figs. 3 and 4. In these figures two points are joined by a straight line 
if the distance between them is 1 (which is the value we have adopted 
for the minimum allowed separation 2r ). Although not shown, the origin 
is always at the centroid of the figure. To aid interpretation of these 
diagrams we have included Fig. 5 which demonstrates how all the signals 
of a typical alphabet can be generated. The functions of time shown in 
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Fig. 5 are not the code signals themselves but impulse functions which 
are to be passed through a low pass filter with cutoff at W c.p.s. to form 
the code signals. 

The best possible higher dimensional alphabets can be described more 
easily verbally than pictorially. In four dimensions we have found four 
alphabets. 

The 25,4 alphabet consists of the origin and all 24 points in 4 dimen- 
sional space having two coordinates equal to zero and the remaining 
two equal to \/y/2 or - l/\/2 . Each of the 24 points lies a unit distance 
away from the origin and its 10 other nearest neighbors; they are, in 
fact, the vertices of a regular solid. This alphabet has an advantage be- 
yond its high efficiency. The code signals are composed entirely of posi- 
tive and negative pulses of fixed energy and so should be easier to 
generate than most of the other codes which appear in this paper. 

The 800, 4 alphabet is constructed in the following way: Consider a 
lattice of points throughout the entire 4-dimensional space formed by 
taking all the linear combinations with integer coefficients of a basic 
set of four vectors. That is, the lattice points are of the form CiVi + 
C 2 v 2 + C3V3 + C4V4 where d , ■ ■ • , C 4 are integers and the v,- are the four 
given vectors. In connection with our problem it is of interest to know 
what lattice, (i.e. what choice of vi , v 2 , v 3 , v 4 ) has all lattice points 
separated at least unit distance from one another and at the same time 
packs as many points as possible into the space per unit volume. When 
a solution to this "packing problem" is known, it is clear that a good 
alphabet can be obtained just by using all the lattice points which are 
contained inside a hypersphere about the origin as the letter points. 
Many of the two dimensional alphabets illustrated in the sketches are 
related in this way to the corresponding two dimensional packing prob- 
lem (which is solved by letting v Y and v 2 be a pair of unit vectors 60° 
apart). A solution to the four dimensional packing problem is affored by 

Vl = V2' V2' °' ° 

V2== V2' °' V2' ° 

V3 = V2> °' °' V2 

Vi = °> V2' V2' °- 
This lattice contains two points per unit volume (twice as dense as the 
cubic lattice in which Vi , • ■ ■ , Vi are orthogonal to one another) and each 
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point has 18 nearest neighbors. A hypersphere of radius 3 about the 
origin has a volume (?r 2 /2)3 4 , about 400. Thus it contains about 800 
lattice points. Take these as the code points of the 800, 4 code. Their 
average squared distances from the origin can be estimated as 



f 

Jo 



r dr 



I r l dr 
Jo 



- i (3)! - 6 - 



N in Equation (13) may be estimated at 18; this is conservative because 
some lattice points outside the sphere are being counted. 

The two remaining four dimensional alphabets belong to two families 
of D-dimensional alphabets. 

The 4, 3; 5, 4; • • • ; D + 1, D ■ ■ • alphabets are the vertices of the 
simplest regular solid in D-dimensional space. For example, 4, 3 is a 
tetrahedron. Such a solid can be constructed from D + 1 vertices whose 
coordinates are the first D -f- 1 rows of the scheme 
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The vertices all lie a distance \/D/2(D + 1) from the centroid of the 
figure. 

6, 3; 8, 4; • • • ; 2D, D, • • ■ are obtained by placing a point wherever 
any positive or negative coordinate axis intersects the sphere of radius 
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1/V2 about the origin. Thus it follows that 6, 3 consists of the ver- 
tices of an octohedron. 

Error correcting alphabets ((k, K, D)): The error correcting alphabets 
discussed in Part I can be converted into good alphabets for this channel 
by replacing all digits which equalled by -1. Three error correcting 
alphabets appear on the chart; each is labelled by three numbers signi- 
fying (k, K, D). 

Slepian alphabets (SD): Using group theoretic methods, 1). Slepian 
has attempted to construct families of alphabets which signal at rates 
approaching C. Although this goal has not yet been reached, families 
of alphabets depending on the parameter D have been found which 
approach the ideal curve to within 6.2 db and then get worse as D — > w. 
In the simplest of these families of alphabets, D = 2m is even and the 
letters consist of all the 2" , C 2m . m sequences containing m zeros, the 
remaining places being filled by ±1. The best alphabet in this family 
is the one with D = 24. It lies 6.23 db away from the ideal curve and 
contains 1.1 x 10 10 letters. The alphabets of this family for D = 10, 24, 
and 70 appear on the efficiency graph labelled *S'10, 524, and £70. 

The conclusion to which one is forced as a result of this investigation is 
that one cannot signal over a channel with signal to noise level much less 
than 7 db above the ideal level of Equation (2) without using an un- 
believably complicated alphabet. No ten digit alphabet tolerates less 
than 7.7 db more than the ideal signal to noise ratio. 

It would be interesting to know more about good higher dimensional 
alphabets. They are very much more difficult to obtain. The regular 
solids, which provided some good alphabets in 3 and 4 dimensions, pro- 
vide nothing new in 5 or more dimensions; there are only three of them 
and they correspond to our D + 1, D; 21), D, and 2 D binary alphabets. 
Worse still, the packing problem also becomes unmanageable after 
dimension 5. 

ACKNOWLEDGMENT 

The author wishes to thank R. W. Hamming, L. A. MacColl, B. 
McMillan, C. E. Shannon, and D. Slepian for many helpful suggestions 
during the investigation summarized by this paper. 



